Modeling acoustic transitions in speech by state-interpolation hidden Markov models

نویسندگان

Li Deng

Patrick Kenny

Matthew Lennig

Paul Mermelstein

چکیده

We present a new type of HMM for vowel-to-consonant (VC) and consonant-to-vowel (CV) transitions based on the locus theory of speech perception. The parameters of the model can he trained automatically using the Baum-Welch algorithm and the training procedure does not require that instances of all possible CV and VC pairs be present. When incorporated into an isolated word recognizer with a 75 000 word vocabulary we find that it leads to a modest improvement in recognition rates.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modeling Acoustic Transitions in Speech by Modified Hidden Markov Models with State Duration and Sta - Speech and Audio Processing, IEEE Transactions on

We propose a modified hidden Markov model (MHMM) that incorporates nonparametric state duration and state duration-dependent observation probabilities to reflect state transitions and to have accurate temporal structures in the HMM. In addition, to cope with the problem that results from the use of insufficient amount of training data, we propose to use the modified continuous density hidden Ma...

متن کامل

Presentation of K Nearest Neighbor Gaussian Interpolation and comparing it with Fuzzy Interpolation in Speech Recognition

Hidden Markov Model is a popular statisical method that is used in continious and discrete speech recognition. The probability density function of observation vectors in each state is estimated with discrete density or continious density modeling. The performance (in correct word recognition rate) of continious density is higher than discrete density HMM, but its computation complexity is very ...

متن کامل

Discriminative training of stochastic Markov graphs for speech recognition

This paper proposes the application of discriminative training techniques based on the Generalized Probabilistic Descent (GPD) approach to Stochastic Markov Graphs (SMGs), a generalization of mixture-state Hidden Markov Models (HMMs), describing the constraints in the acoustic structure of speech as a graph consisting of nodes, each containing a base function, and a transition network between t...

متن کامل

Presentation of K Nearest Neighbor Gaussian Interpolation and comparing it with Fuzzy Interpolation in Speech Recognition

متن کامل

Synthesis of fast speech with interpolation of adapted HSMMs and its evaluation by blind and sighted listeners

In this paper we evaluate a method for generating synthetic speech at high speaking rates based on the interpolation of hidden semi-Markov models (HSMMs) trained on speech data recorded at normal and fast speaking rates. The subjective evaluation was carried out with both blind listeners, who are used to very fast speaking rates, and sighted listeners. We show that we can achieve a better intel...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

IEEE Trans. Signal Processing

دوره 40 شماره

صفحات -

تاریخ انتشار 1992

Modeling acoustic transitions in speech by state-interpolation hidden Markov models

نویسندگان

چکیده

منابع مشابه

Modeling Acoustic Transitions in Speech by Modified Hidden Markov Models with State Duration and Sta - Speech and Audio Processing, IEEE Transactions on

Presentation of K Nearest Neighbor Gaussian Interpolation and comparing it with Fuzzy Interpolation in Speech Recognition

Discriminative training of stochastic Markov graphs for speech recognition

Presentation of K Nearest Neighbor Gaussian Interpolation and comparing it with Fuzzy Interpolation in Speech Recognition

Synthesis of fast speech with interpolation of adapted HSMMs and its evaluation by blind and sighted listeners

عنوان ژورنال:

اشتراک گذاری